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(54) Title: NEW SECRETORY LEADER SEQUENCES 



(57) Abstract 



Secretory leader sequences, for use in secreting heterologous polypeptides in yeast, are formed by fusing part of the human 
serum albumin pre-sequence or part of the Kluyveromyces lactis killer toxin pre-sequence to the Saceharomyces cerevisiae mating 
factor aipha-1 KEXl cleavage recognition site. The resulting fusion leader sequences are: (a) HjN-Met-Lys-Trp-Val-Ser-Phe- 
Ile-Ser^Leu-Leu-Phe-Leu-Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH or (b) H 2 N-Met-Asn-Ile-Phe*Tyr-Ile- 
Phe-Leu-Phe-Leu-Leu-Ser~Phe-Val-Gln-Gly-Ser- Leu- Asp-Lys- Arg-COOH. Conservative variations are also encompassed. 
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New secret ory lead e r sequences . 



This invention relates to secretory leader sequences 
which can be employed to direct the secretion of a 
heterologous protein (such as human serum albumin) from 
fungi (for example the yeast Saccharomyces cerevi siae) . 

Translocation of protein molecules through bi-lipid 
membranes from one cellular compartment to another 
generally relies upon information held within the primary 
amino acid sequence of the protein itself . The most 
prevalent and therefore the best characterised sequence 
information is the amino terminal leader or signal 
sequence of prokaryotic and eukaryotic organisms . Genetic 
studies in which the signal sequence has been totally or 
extensively deleted indicate that the signal sequence is 
essential for protein translocation (Benson, S.A. et al . 
1985, Ann. Rev. Biochem. 54 , 101-134 ) . Among several 
hundred known sequences (Watson, M.E.E. , 1984 , Nuc • Acid, 
Res . 12 , 5145-5164 ) no consensus signal sequence or even 
an absolute requirement for any amino acid at any given 
position can be discerned, although a common feature of 
many leader sequences is a core of 7-10 hydrophobic amino 
acids . Genetic manipulations which result in alterations 
to the hydrophobic core, either by deletion or by 
inserting charged residues, generally result in a block 
in protein trans location ( Benson , S > A. , et al . 1985 , Ann. 
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Rev* Biochem- 54 , 101-134 ) ♦ Moreover , in a series of 
extensive modifications to the chicken lysozyme leader 
sequence, Yamamoto et al. 1987 (Biochem. and Biophys . 
Res . Comm. 149 , 431-436 ) have shown that, while some 
alterations to the hydrophobic core can result in the 
abolition of secretion, others can potentiate the leader 
sequence function , resulting in increased levels of 
protein secretion. 

While the leader sequence is usually essential for 
the translocation of proteins across membranes , once 
trans located these sequences are usually endoproteo- 
lytically cleaved by enzymes contained within the 
cellular compartments into which the proteins have now 
moved . These enzymes recognise specific amino acid 
sequences within the primary structure of the 
translocated protein . Moreover, complete processing of 
certain eukaryotic proteins to their mature form often 
relies upon a series of proteolytic cleavages (Bussey, 
H. , 1988 Yeast ±, 17-26) . 

With the recent advances in recombinant DNA 
technology, increasing resources have been brought to 
bear on the commercial exploition of fungi , particularly 
yeasts , as vehicles for the production of a diverse range 
of proteins • 
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Since many of these proteins are themselves 
naturally secreted products, it is possible to utilise 
the information contained within the leader sequence to 
direct the protein through the secretion pathway. 
However, this information is contained within a peptide 
foreign to yeast. Its recognition and subsequent 
processing by the yeast secretory pathway are not 
necessarily as efficient as those of a homologous yeast 
leader sequence. As a consequence an alternative approach 
has been to replace the leader sequence with one derived 
from a naturally secreted yeast protein . 

The most widely used yeast secretory sequence is the 
89 amino acid leader sequence of the alpha-factor mating 
pheromone. Processing of this leader has been extensively 
studied (Kurjan & Herskowitz, Cell 30, 933-943, 1982; 
Julius et al. 1983 Cell 32, 839-852; Dmochowska* et al. 
Cell 50, 573-584, 1987; Julius et al. Cell 36: 309-318, 
1984; Julius et al. Cell 37, 1075-1085, 1984) and 
requires at least four gene products for complete 
proteolytic cleavage to liberate the mature 13 amino acid 
alpha-factor pheromone. 

Complete proteolytic cleavage of the alpha-factor 
primary translation product requires first the removal of 
the N- terminal 19 amino acid signal sequence by a signal 
peptidase within the endoplasmic reticulum. Following 
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this the sequential action of three gene products located 
within the golgi apparatus processes the large precursor 
molecule, liberating four copies of the alpha-factor 
pheromone . These are the KEX2 gene product , an 
endopeptidase which cleaves after the Lys-Arg dibasic 
amino acid pair, a carboxypeptidase £-like cleavage, 
recently identified as the product of the KEX1 gene , and 
a dipeptidyl amino peptidase, the product of the STE13 
gene, which s equen t i a 1 ly remove s the Glu-Ala or Asp-Ala 
diamino acid pairing preceding the mature alpha-factor 
pheromone . 

The alpha factor prepro leader sequence has 
successfully been employed to secrete a range of diverse 
proteins and peptides . However , when the alpha-factor 
signal is used to direct secretion of human serum 
albumin , we have found that a large proportion of the 
extracellular HSA produced is in the form of a 45KD 
N-terminal fragment, 

EP-A-252 561 (Sclavo) discloses the use of the 16 
amino acid signal peptide (pre- sequence ) from the killer 
toxin of Kluyveromyces lactis to aid secretion of 
heterologous proteins in yeast. 



.5 

A further possibility is to use a fusion secretory 
leader sequence. This may be generated by the fusion of 
two independent sequences . A hybrid signal in which the 
first amino acids of the acid phosphatase signal were 
fused to the proteolytic cleavage site of human alpha 
interferon resulted in the expression and secretion of 
interferon ( Hinnen et al. Foundation for Biochemical and 
Industrial Fermentation Research, 229 , 1219-1224, 1983 ) ; 
10% of the interferon produced was secreted into the 
medium . In a similar approach the first 22 amino acids of 
the alpha-factor leader were fused to the last twelve 
amino acids of the human interferon alpha- 2 signal 
sequence resulting in the secretion of interferon alpha-2 
into the culture supernatant ( Piggott et al. Curr . Genet . 
12 561-567 , 1987). An identical construct in which the 
int er f eron alpha- 2 gene was replaced by the interferon J3 
gene did not result in any secretion of human interferon 
J3 into the culture supernatant . Finally, in a series of 
experiments designed to assess the effect of leader 
sequences on the secretion of human lysozyme, Yoshimura 
et al . (Biochem. & Biophys . Res . Comm. 145, 712-718, 
1987) described a fusion leader comprising the first 9 
amino acids of the chicken lysozyme leader and the last 9 
amino acids of the Aspergillus awamori glycoamylase 
leader. Although this fusion leader was effective in 
secreting 60% of the produced material into the culture 
supernatant, it was only 15% as effective as the entire 
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chicken lysozyme leader. Moreover, no secreted product 
could be detected if the human lysozyme sequences were 
preceded by the entire Aspergillus qlvcoamylase leader r 
or a fusion derived from the first 9 amino acids of the 
Aspergillus glucoamylase leader and the last 9 amino 
acids of the chicken lysozyme leader. 

We have now devised new and advantageous leader 
sequences for use in fungi ♦ 

One aspect of the invention provides an amino acid 
sequence as follows: 

(a) H2N-Met-Lys-Trp-Val-Ser^Phe-Ile-Ser-Leu-Leu«Phe-Leu- 
Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys^Arg-COOH 

or 

(b) H2N-Met-Asn-Ile-Phe-Tyr-Ile-Phe-Leu-Phe-Leu-Leu-Ser- 
Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

or conservatively modified variations of either sequence . 

Table 1 shows alternative amino acids for each 
position except the initial methionine . Any of the 
possible permutations are within the scope of the 
invention . The selection of lysine or arginine for the 
last two positions is particularly non-critical, although 
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there should always be Lys or Arg at each of these 
positions . Preferably, positions 20 and 21 of sequence 
(a) are not Gly and Val respectively. Sequences which 
are up to four amino acids shorter or longer are also 
included provided that the C-terminal (Lys, Arg) , Lys -Lys 
or Arg-Arg entity is maintained , there is a positively 
charged residue within 5 residues of the N-terminus and 
there is a generally hydrophobic region at or adjacent 
the middle of the sequence. 
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Table 1 
Leader fa) 



1 

Met Lys Trp Val Ser 
Arg Phe Leu Thr 
His Tyr He Gly 
Gin Met Ala 

Asn 

Ser Ala Tyr Ser Arg 
Thr Thr Phe Thr Lys 
Gly Gly Trp Gly His 
Ala Ser Ala Gin 

Asn 



10 

Phe He Ser Leu Leu Phe Leu Phe Ser 
Trp Leu Thr He He Trp He Trp Thr 
Tyr Val Gly Val Val Tyr Val Tyr Gly 
Met Ala Met Met Met Ala 



20 

Ser Leu Asp Lys Arg 
Thr He Glu Arg Lys 
Gly Val Asn 
Ala Met Gin 
His 



9 

Leader (b) 

Met Asn lie Phe Tyr lie Phe Leu Phe Leu Leu Ser Phe Val 
Asp Leu Trp Phe Leu Trp lie Trp lie lie Thr Trp Leu 
Glu Val Tyr Trp Val Tyr Val Tyr Val Val Gly Tyr lie 
Gin Met Met Met Met Met Ala Met 

■ His 

Gin Gly Ser Leu Asp Lys Arg 
Asp Ser Thr lie Asn Arg Lys 
Asn Thr Gly Val Glu 
Glu Ala Ala Met Gin 
His His 

A second aspect provides a fusion compound 
comprising any of the said amino acid sequences linked , 
preferably directly, at the carboxyl terminal to the N- 
terminal residue of a polypeptide. The polypeptide may be 
any desired polypeptide, including "pro-polypeptides " (in 
other words precursors which undergo pos t- trans lational 
cleavage or other modification, such as glycosylation ) . 
The term "polypeptide " encompasses oligopeptides . The 
polypeptide may be fibronectin or a portion thereof (for 
example the collagen or fibrin-binding portions described 
in EP 207 751), urokinase, pro -urokinase, the 1-368 
portion of CD4 ( D • Smith et al (1987) Science 328, 1704- 
1707 ) , platelet derived growth factor (Collins et al 
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(1985) Nature 316, 748-750), transforming growth factor 0 
(Derynck et al (1985) Nature 316, 701-705), the 1-272 
portion of Von Willebrand's Factor (Bontham et al, Nucl. 
Acids Res. 14 7125-7127), the Cat heps in p fragment of 
fibronectin (585-1578), a x -anti trypsin, plasminogen 
activator inhibitors , factor VIII, a-glpbin, 0-globin, 
myoglobin or nerve growth factor or a conservative 
variant of any of these. The polypeptide may also be a 
fusion of HSA or an N-terminal portion thereof and any 
other polypeptide, such as those listed above. 
Preferably, the polypeptide is a naturally-occurring 
human serum albumin , a modified human serum albumin or a 
fragment of either, such modified forms and fragments 
being termed "variants". These variants include all forms 
or fragments of HSA which fulfill at least one of the 
physiological functions of HSA and which are sufficiently 
similar to HSA, in terms of structure (particularly 
tertiary structure ) as to be regarded by the skilled man 
as forms or fragments of HSA. 



In particular variants or fragments of HSA which 
retain at least 50% of its ligand-binding properties, for 
example with respect to bilirubin Or fatty acids, 
(preferably 80%, or 95%) are encompassed. Such 
properties are discussed in Brown, J.R . & Shockley, p . 
(1982) in Lipid -Protein Interactions 1, 26-68, Ed. Jost, 
P.C. & Griffith, O.H. 
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The portion of HSA disclosed in EP 322 094 is an 
example of a useful fragment of HSA which may be secreted 
by use of the leader sequences of the invention. 

A third aspect provides a nucleotide sequence coding 
for any of the said amino acid sequences or for the said 
fusion compound . The nucleotide sequence ( or the portion 
thereof encoding the leader sequence) may be selected 
from the possibilities shown in Tables 2 & 3 , for 
sequences (a) and (b) respectively, where the codons 
encoding each amino acid are listed under the amino 
acids. The codons of Tables 2 and 3 clearly relate to 
RNA, but it is to be understood that equivalent DNA 
nucleotide sequences are also within the scope of this 
aspect of the invention. 
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Table 2 



Met 


Lys 


Trp 


Val 


Ser 


Phe 


lie 


Ser 


Leu 


Leu 




Leu 


Phe 


Ser 


AUG 


AAA 


UGG 


GUU 


UCU 


UUU 


AUU 


UCU 


UUA 


UUA 


UUU 


UUA 


UUU 


UCU 




AAG 




GUC 


UCC 


UUC 


AUC 


UCC 


UUG 


UUG 


UUC 


UUG 


UUC 


UCC 








GUA 


UCA 




AUA 


UCA 


CUU 


CUU 




CUU 




UCA 








GUG 


UCG 






UCG 


cue 


cue 




cue 




UGG 










AGU 






AGU 


CUA 


CUA 




CUA 




AGU 










AGC 






AGC 


CUG 


CUG 




CUG 




AGC 


Ser 


Ala 


Tyr 


Ser Arg 


Ser 


Leu 


Asp 


Lys 


Arg 










UCU 


GCU 


UAU 


UCU 


CGU 


UCU 


UUA 


GAU 


AAA 


CGU 










UCC 


GCC 


UAC 


UCC 


CGC 


UCC 


UUG 


GAC 


AAG 


CGC 










DCA 


GCA 




UCA 


CGA 


UCA 


CUU 






CGA 










UCG 


GGG 




UCG 


CGG 


UCG 


cue 






CGG 










AGU 






AGU 


AGA AGU 


CUA 






AGA 










AGC 






AGC 


AGG 


AGC 


CUG 






AGG 
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Table 3 

Met Asn lie Phe Tyr lie Phe Leu Phe Leu Leu Ser Phe Val 
AUG AAU AUU UUU UAU AUU UUU UUA UUU UUA UUA UCU UUU GUU 
AAC AUG UUC UAC AUC UUC UUG UUC UUG UUG UCC UUC GUC 
AUA AUA GUU CUU CUU UCA GUA 

CUC CUC CUC UCG GUG 

CUA CUA CUA AGU 

CUG CUG CUG AGC 

Gin Gly Ser Leu Asp Lys Arg 
CAA GGU UCU UUA GAU AAA CGU 
CAG GGC UCC UUG GAC AAG CGC 
GGA UCA CUU CGA 
GGG UCG CUC CGG 
AGU CUA AGA 
AGC CUG AGG 



A fourth aspect provides a DNA construct comprising a 
suitable control region or regions and a nucleotide 
sequence as defined above, the sequence being under the 
control of the control region . By " suitable control 
region " we mean such DNA regions as are necessary to 
enable the said nucleotide sequence to be expressed in 
the host for which the construct is intended . The control 
region will usually include transcriptional start and 
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stop sequences, 3 ' -polyadenylation sequences, a promoter 
and, often, an upstream activation site for the promoter. 
The man skilled in the art will readily be able to select 
and assemble suitable regions from those available in 
this art. However, specific examples of suitable 
expression vectors and their construction include those 
disclosed in EP 198 745, GB 2 171 703 (for B.subtilis ) , 
EP 207 165, EP 116 201, EP 123 244, EP 123 544, EP 147 
198, EP 201 239, EP 248 637, EP 251 744, EP 258 067, EP 
286 424 and EP 322 094. 

A fifth aspect provides a host transformed with the 
said DNA construct . The host may be any host in which the 
construct is found to work adequately, including 
bacteria, yeasts , filamentous fungi, insect cells , plant 
cells and animal cells. Preferably, however, the host is 
Saccharomyces cerevisiae or Schizosaccharomyces pombe, 
most preferably the former. As many native secretion 
signals are effective in heterologous hosts (for example 
the natural HSA leader sequence in yeast) it is entirely 
reasonable to suppose that the leader sequences of the 
invention will function in hosts other than yeasts . 

A sixth aspect provides a process for preparing a 
polypeptide , comprising cultivating the said host and 
obtaining therefrom the polypeptide expressed by the said 
nucleotide sequence, or a modified version thereof. 
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By "modified version thereof", we mean that the actual 
polypeptide which is separated may have been post- 
translationally modified, in particular by cleavage of 
the leader sequence, 

A seventh aspect provides a polypeptide prepared by 
such a process. 

So that the invention may be more readily 
understood, preferred aspects will now be illustrated by 
way of example and with reference to the accompanying 
drawings in which: 



Figure 1 is a restriction map of plasmid pEK113; 
Figure 2 is a restriction map of plasmid pEK25; 
Figure 3 is a restriction map of plasmid pAYE230; 
Figure 4 is a restriction map of plasmid pAYE238; 
Figure 5 is a restriction map of plasmid pAYE305 ; 



and 



Figure 6 is a restriction map of plasmid pAYE305 . 
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Example of a prior art type of leader sequence 

The DNA coding sequence for mature HSA protein has 
been placed immediately downstream of a DNA sequence 
encoding the KEX2 cleavage site of the alpha factor pre 
pro leader sequence (85 amino acids ) . When this protein 
sequence is placed under the control of a promoter on a 
yeast autonomously replicating plasmid and transformed 
into a haploid strain of the yeast Saccharomyces 
cerevisiae, mature HSA can be detected in the culture 
supernatant . N-terminal amino acid sequence information 
indicates that the secreted protein has the same N-- 
terminal amino acid composition as natural HSA, namely 
Asp-Ala-His • This also indicates that the first two amino 
acids of the secreted HSA are not susceptible to the 
dipeptidyl endopeptidase, the product of the STE13 gene, 
as this enzyme is responsible for the removal of such 
sequences from between successive repeats of the alpha- 
factor pheromone. Although mature HSA is the major 
product observed in the culture supernatant, a N-terminal 
fragment of HSA (45 kilodaltOns ) was also detected, 
representing approximately 15% of the total HSA 
synthesisecL This fragment c omponent r epr e sents not only 
a waste of secretion capacity but also certain downstream 
purification problems in that, as a fragment of HSA, it 
shares some biochemical and biophysical properties with 
intact HSA. 
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EXAMPLE 1 

We have constructed a fusion leader which may be 
regarded as the natural HSA leader sequence from which 
the last five amino acids have been removed , to be 
replaced by the five amino acids preceding the KEX2 
cleavage site of the alpha-factor pre pro leader 
sequence, i.e. amino acids 81 to 85 are Ser-Leu-Asp-Lys- 
Arg (Table 2). 

When transformed with suitable plasmid vectors 
incorporating the fusion leader, yeast secrete mature HSA 
into the culture supernatant at levels comparable to that 
observed with the alpha-factor leader sequence- N- 
terminal sequence analysis indicates that the mature HSA 
possesses the correct N-terminal amino acid composition. 

Moreover, substitution of the alpha-factor leader by 
the fusion leader sequence has been found to result in a 
6 fold reduction in the levels of the 45 kd fragment 
observed in the culture supernatant . This therefore 
represents a significant improvement in the reduction of 
the contaminating polypeptides , thus aiding the 
purification of mature HSA from yeast culture 
supernatants . 
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Details 



Unless otherwise stated all procedures were carried 
out as described by Maniatis et al (1982). Plasmid pEK113 
(Figure 1) (EP-A-248 637 ) was digested to completion with 
the restriction endonuc leas es Mstll and HindTII . DNA was 
recovered by phenol/ chloroform extraction and ethanol 
precipitation . The linearised plasmid DNA was then 
treated with the Klenow fragment of E. coll DNA polymerase 
I to generate a linearised DNA molecule with blunt ends . 



The following oligonucleotide duplex (I) was 
constructed on an automated Applied Biosystems Inc 3 8 OB 
DNA synthesiser ( according to manufacturer ' s 
instructions ) . 

Oligonucleotide I 

5' " 3' 

GGC TTA TAA GGA TCC TTA TAA GCC 
CCG AAT ATT CCT AGG AAT ATT CGG 



The o ligonuc leo t ide duplex was ligated with 
equimolar quantities of linearised, blunt ended pEKll3V 
E . coli strain MC1061 was transformed with the ligation 
mixture and cells receiving DNA were selected on an 
ampicillin-containing medium (50ug/ml ampicillin) . 
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Recombinant plasmids containing the oligonucleotide 
duplex were screened by digesting DNA prepared from 
individual colonies with the restriction endonucleases 
Mst.II and EcoRI . Plasmid pEK25 was thus formed (Figure 
2). 

Plasmid pEK25 was digested to completion with the 
restriction endonucleases Xba l and BamHI , DNA fragments 
were separated by electrophoresis through a 1% (w/v) 
agarose gel and a 688 base pair Xba l - BamHI DNA fragment 
recovered from the gel by electroelution. 

The plasmid mpi9.7 (EP-A-24 8 637) was digested to 
completion with the restriction endonuclease Xhol . The 
linearised DNA was phenol/chloroform extracted and 
ethanol precipitated. The recovered DNA was then treated 
with the Klenow fragment of E. coli DNA polymerase I as 
previously described, following which the DNA was 
phenol/chloroform extracted and ethanol precipitated. The 
recovered DNA was then digested to completion with Xba l 
and the digestion products separated by agarose gel 
electrophoresis . A 1067 base pair fragment was recovered 
from the gel by electroelution . The following oligo- 
nucleotide duplex ( II ) was prepared as described 
previously. 
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5' 

GATCC ATG AAG TGG -GTA AGC TTT ATT TCG CTT CTT TTT GTC 
TAC TTC ACC CAT TCG AAA TAA AGG GAA GAA AAA. GAG 

' . : ; - 3:/ , ; - 
TTT AGC TCG GCT TAT TCC AGG AGC TTG GAT AAA AG A 
AAA TCG AGC CGA ATA AGG TCC TCG AAC CTA TTT TCT 

The plasmid pUC19 ( Yanisch-Perron et al. 1985) was 
digested to completion with the restriction endonuclease 
BamH I . Linearised DNA was recovered by phenol / chloro form 
extraction and ethanol precipitation. 

Equimolar quantities of the BamHI digested pUC19 r 
the oligonucleotide duplex II, the 1067 b.p. DNA fragment 
derived from mpl9.7 and the 688 b.p. DNA fragment derived 
from pEK25 were ligated together . E.coll DH5 was trans- 
formed with the ligated DNA and trans f ormant s selected on 
50ug/ml ampicillin L-broth agar . Recombinant colonies 
containing the desired plasmid , designated pAYE 230 
(Figure 3) were selected by digested DNA obtained from 
individual colonies with the restriction endonuclease 
BamHI. 
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Plasmid pAYE 230 was digested to completion with 
BamHI and the products separated by electrophoresis 
through a 1 % agarose gel . The 1832 base pair fragment 
containing the HSA coding sequence was recovered by 
electroelution . 

Plasmid pMA91 (Mellor et al, 1983) was digested to 
completion with Bglll under standard- conditions . The 
linearised plasmid was phenol /chloroform extracted and 
ethanol precipitated* 

Equivalent quantities of the linearised pMA91 and 
the DNA fragment prepared from pAYE 230 were ligated 
under standard conditions . E . coli DH5 was transformed 
with the ligation mixture and cells receiving the DNA 
selected on L-broth agar containing 5.0 jig /ml ampicillin. 
Colonies containing the desired plasmid, designating pAYE 
238 ( Figure 4) were selected by digesting the DNA from 
such colonies with PvuII • 

Plasmid pAYE 238 was transformed into the yeast 
Saccharomyces cerevisiae strain S150-2B as described by 
Hinnen et al . (1978). Cells receiving plasmid pAYE 238 
were selected on minimal medium, supplemented with 2% 
(w/v) glucose, 20mg/l histidine, 2 Omg/1 tryptophan and 
20mg/l uracil . 
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Transformed S150-2B cells were transferred to 10ml 
YEPD media containing 2% (w/v) glucose and incubated at 
30°C, 200rpm for 72 hours. Cell free culture supernatants 
were analysed by discontinuous native 8-25% gradient 
polyacrylamide gel electrophoresis on a Pharmacia Phast 
System, as described in the manufacturer's instructions . 
Cells were stained and destained and the relative 
quantities of native HSA and HSA fragment estimated by 
gel scan at 595nm. 



EXAMPLE 2 

We have also constructed a second fusion leader which 
consists of the 16 amino acid pre region of the 97/000 
dalton Kluyveromyces lactis killer ( ORF 2 ) toxin (Stark 
and Boyd, 1986, Tokumaga et al 1987) fused to the five 
amino acids preceding the KEX2 cleavage site of the 
alpha-factor prepro leader sequence, i.e. amino acids 81 
to 85, Ser-Leu-Asp-Lys-Arg (Table 3). 

When transformed with plasmid vectors incorporating the 
fusion leader described in Table 3, yeast secreted mature 
HSA into the culture supernatants at levels higher than 
when either the natural K. lactis prepro killer toxin 
leader sequence or the alpha-factor prepro leader 
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sequence was used. N-terminal sequence analysis 

indicates that the mature HSA possesses the correct N- 
terminal amino acid composition. 

Substitution of the alpha-factor leader by the K.lactis 
killer /alpha factor fusion leader sequence resulted in a 
six fold reduction in the levels of the 45kd fragment 
observed in the culture supernatant . This therefore 
represents a significant improvement in the reduction of 
the contaminating polypeptides, thus aiding the 
purification of mature HSA from yeast culture 
supernatants . 

Details 

The experimental procedures employed to generate a yeast 
HSA secretion vector utilising the K.lactis killer/alpha 
factor fusion leader were identical to those described in 
Example 1 , except that oligonucleotide duplex (II) was 
replaced by oligonucleotide duplex (III) synthesised on 
an automated Applied Biosys terns Inc. 3 8 OB DNA synthesiser 
(according to manufacturer's instructions) . 



Oligonucleotide duplex III 

GATGC ATG AAT ATA TTT TAC ATA TTT TTG TTT TTG CTG TCA TTC 
TAC TTA TAT AAA ATG TAT AAA AAC AAA AAC GAC AGT AAG 
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GTT CAA GGA AGC TTG GAT AAA AGA 
CAA GTT CCT TCG AAG CTA TTT TCT 

Equimolar quantities of the BamH I digested pUC19, the 
oligonucleotide duplex 111, the 106 7bp DNA fragment 
derived from mpl9 .7 and the 688b. p. DNA fragment derived 
from pEK25 were ligated together . E.coli DH5 was 
transformed with ligated DNA and trans formants selected 
on 50|ig/ml ampicillin L-broth agar* Recombinant colonies 
containing the desired plasmid, designated pAYE3 04 
(Figure 5) , were selected by digested DNA obtained from 
individual colonies with the restriction endonuc lease 
BamHI, ■ " - . 



Plasmid pAYE304 was digested to completion with BamHI and 
the products separated by electrophoresis through a 1% 
agarose gel. The 1823 base pair fragment containing the 
HSA coding sequence was recovered by electroelution . 

Plasmid pMA91 (Mellor et al, 1983) was digested to 
completion with Bgl ll under standard conditions . The 
linearised plasmid was phenol/chloroform extracted and 
ethanol precipitated • 

Equivalent quantities of the linearised pMA9 1 and the DNA 
fragment prepared from pAYE304 were ligated under 
standard conditions . E . coli DH5 was transformed with the 
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ligation mixture and cells receiving DNA selected on L- 
broth agar containing SOj-ig/ml ampicillin. Colonies 
containing the desired plasmid , designating pAYE305 
(Figure 6), were selected by digesting the DNA from such 
colonies with PvuII . 



Plasmid pAYE305 was transformed into the yeast 
Saccharomyces cerevisiae strain S150-2B as described by 
Hinnen et al, (1978). Cells receiving plasmid pAYE305 
were selected on minimal medium, supplemented with 2 % 
(w/v) glucose, 20mg/l histidine, 20mg/l tryptophan and 
20mg/l uracil . 

Transformed S150-2B cells were transferred to 10ml YEPD 
medium containing 2% (w/v) glucose and incubated at 30°C, 
200rpm for 72 hours. Cell free culture supernatants were 
analysed by discontinuous native 8-25% gradient 
polyacrylamide gel elec trophores i s on a Pharmacia Phast 
System, as described in the manufacturer ' s instructions . 



Cells were stained and destained and the relative 
quantities of native HSA and HSA fragment estimated by 
gel scan at 595nm. 

EXAMPLE 3 



Using a vector based on the disintegration vectors of 
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EP286424 (Delta Biotechnology), a suitable prompter and 
the fusion leader of Example 1 above , Schizosaccharomyces 
pombe (strain Leul »32h) was transformed and fermented at 
30°C in 10ml of EMM (Edinburgh minimal medium , Ogden , 
J.E. & Fantes , P. A. (1986) Curr. Genetics 10 509-514) r 
buffered to pH 5.6 with 0.1M citric acid/ sodium 

phosphate , to give 10-15 mg/1 of HSA in the culture 
supernatant after 3 days. 
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CLAIMS 

1 • Aa amino acid sequence as follows : 

(a) H2N-Met-Lys-Trp-Val-Ser-Phe-Ile-Ser-Leu-Le\i-Phe-Leu- 
Phe-Ser-Ser-Ala-Tyr-Ser-Arg-Ser-Leu-Asp-Lys-Arg-COOH 

or 

( b) H2N-Met-Asn~Ile-Phe-Tyr-Ile-Phe-Leu-Phe~Leu-Leu-Ser- 
Phe-Val-Gln-Gly-Ser-Leu-Asp-Lys-Arg-COOH 

or conservatively modified variants of either sequence. 

2. A fusion compound comprising an amino acid sequence 
according to Claim 1 linked at the carboxyl terminal to 
the N-terminal residue of a polypeptide . 



3. A fusion compound according to Claim 2 wherein the 
said amino acid sequence is linked directly to said 
polypeptide. 



4. A fusion compound according to Claim 3 wherein the 
polypeptide is a naturally-occurring human serum albumin , 
a modified human serum albumin or a fragment of either. 



5 . A nucleotide sequence coding for the amino acid 
sequence of Claim 1 or for a fusion compound according to 
Claim 2. 



WO 90/01063 



PCT/GB89/00816 



30 

5 . A nucleotide sequence according to Claim 5 selected 
from the sequences shown in Table 2 or 3. 

7. A DNA construct comprising a suitable control region 
or regions and a nucleotide sequence according to Claim 5 
or 6, the sequence being under the control of the control 
region . 

8. A host transformed with a DNA construct according to 
Claim 7 . 

9 ♦ Saccharomyces cerevisiae or Schizosaccharomyces 
pombe according to Claim 8. 



10. A process for preparing a polypeptide , comprising 
cultivating a host according to Claim 8 or 9 and 
obtaining therefrom the polypeptide expressed by the said 
nucleotide sequence or a modified version thereof. 

11 . A polypeptide prepared by a process according to 
Claim 10. 

12 • Human serum albumin or a variant thereof prepared by 
a process according to Claim 10 . 
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